Consistency of Sequential Bayesian Sampling Policies

نویسندگان

Peter I. Frazier

Warren B. Powell

چکیده

We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes ranking and selection, continuous global optimization, and many other problems in sequential experimental design. We give a sufficient condition under which measurement policies sample each measurement type infinitely often, ensuring consistency, i.e., that a globally optimal future decision is found in the limit. This condition is useful for verifying consistency of adaptive sequential sampling policies that do not do forced random exploration, making consistency difficult to verify by other means. We demonstrate the use of this sufficient condition by showing consistency of two previously proposed ranking and selection policies: OCBA for linear loss, and the knowledge-gradient policy with independent normal priors. Consistency of the knowledge-gradient policy was shown previously, while the consistency result for OCBA is new.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Asymptotic Optimality of Sequential Sampling Policies for Bayesian Information Collection

We consider adaptive sequential sampling policies in a Bayesian framework. Under the assumptions that the sampling distribution is from an exponential family and that the number of distinct measurement types is finite, we give sufficient conditions for an adaptive sampling policy to achieve asymptotic optimality. Here, asymptotic optimality is understood to mean that the limit of the expected l...

متن کامل

Convergence to Global Optimality with Sequential Bayesian Sampling Policies

We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes problems in ranking and selection, reinforcement learning, and continuous global optimization. We give sufficient conditions under which measurement policies achieve asymptotically minimal expected loss. Achieving asymptotically minimal expected l...

متن کامل

AGM-consistency and perfect Bayesian equilibrium. Part II: from PBE to sequential equilibrium

In [6] a general notion of perfect Bayesian equilibrium (PBE) for extensive-form games was introduced and shown to be intermediate between subgame-perfect equilibrium and sequential equilibrium. Besides sequential rationality, the ingredients of the proposed notion are (1) the existence of a plausibility order on the set of histories that rationalizes the given assessment and (2) the notion of ...

متن کامل

Policy Explanation and Model Refinement in Decision-Theoretic Planning

Decision-theoretic systems, such as Markov Decision Processes (MDPs), are used for sequential decision-making under uncertainty. MDPs provide a generic framework that can be applied in various domains to compute optimal policies. This thesis presents techniques that offer explanations of optimal policies for MDPs and then refine decision theoretic models (Bayesian networks and MDPs) based on fe...

متن کامل

Online Bayesian phylogenetic inference: theoretical foundations via Sequential Monte Carlo.

Phylogenetics, the inference of evolutionary trees from molecular sequence data such as DNA, is an enterprise that yields valuable evolutionary understanding of many biological systems. Bayesian phylogenetic algorithms, which approximate a posterior distribution on trees, have become a popular if computationally expensive means of doing phylogenetics. Modern data collection technologies are qui...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

SIAM J. Control and Optimization

دوره 49 شماره

صفحات -

تاریخ انتشار 2011

Consistency of Sequential Bayesian Sampling Policies

نویسندگان

چکیده

منابع مشابه

Asymptotic Optimality of Sequential Sampling Policies for Bayesian Information Collection

Convergence to Global Optimality with Sequential Bayesian Sampling Policies

AGM-consistency and perfect Bayesian equilibrium. Part II: from PBE to sequential equilibrium

Policy Explanation and Model Refinement in Decision-Theoretic Planning

Online Bayesian phylogenetic inference: theoretical foundations via Sequential Monte Carlo.

عنوان ژورنال:

اشتراک گذاری